The pure parsimony haplotyping problem: overview and computational advances

نویسندگان

  • Daniele Catanzaro
  • Martine Labbé
چکیده

Haplotyping estimation from aligned Single Nucleotide Polymorphism (SNP) fragments has attracted more and more attention in the recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from mapping of complex disease genes to inferring population histories, passing through designing drugs, functional genomics and pharmacogenetics. The literature proposes a number of estimation criteria to select a set of haplotypes among possibles alternatives. Usually, such criteria can be expressed under the form of objective functions, and the sets of haplotypes that optimize them are referred to as optimal. One of the most important estimation criteria is the pure parsimony which states that the optimal set of haplotypes for a given set of genotypes is the one having minimal cardinality. Finding the minimal number of haplotypes necessary to explain a given set of genotypes involves solving an optimization problem, called the Pure Parsimony Haplotyping (PPH) estimation problem, which is notoriously NP-Hard. This article provides an overview of PPH, and discusses the different approaches to solution that occur in the literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

1 Haplotype Inference

Fresh Pond Research Institute 1.1 Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-1 1.2 Introduction to Variation, SNPs, Genotypes, and Haplotypes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1-2 The Biological Problem • The Computational Problems • The Need for a Genetic Model • Two Major Ap...

متن کامل

Approximation algorithms for the minimum rainbow subgraph problem

Our research was motivated by the pure parsimony haplotyping problem: Given a set G of genotypes, the haplotyping problem consists in finding a set H of haplotypes that explains G. In the pure parsimony haplotyping problem (PPH) we are interested in finding a set H of smallest possible cardinality. The pure parsimony haplotyping problem can be described as a graph colouring problem as follows: ...

متن کامل

Computational Experiments on Algorithms for Haplotype Inference Problems by Pure Parsimony

To analyze the function of DNA, researchers have to obtain each haplotype, the genetic constitution of an individual chromosome, of an individual for analysis. Due to the significant efforts required in collecting haplotypes, the descriptions of one conflated pair of haplotypes called genotypes are usually collected. Since the genotype data contains insufficient information to identify the comb...

متن کامل

A Class Representative Model for Pure Parsimony Haplotyping

Haplotyping estimation from aligned Single Nucleotide Polymorphism (SNP) fragments has attracted more and more attention in the recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from mapping of complex disease genes to inferring population histories, passing through designing drugs, functional genomics and pharmacogenetics. The literatu...

متن کامل

A Class Representative Model for Pure Parsimony Haplotyping under Uncertain Data

The Pure Parsimony Haplotyping (PPH) problem is a NP-hard combinatorial optimization problem that consists of finding the minimum number of haplotypes necessary to explain a given set of genotypes. PPH has attracted more and more attention in recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from mapping complex disease genes to inferri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • ITOR

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2009